Rank in Wordlist | Frequency | Word |
---|---|---|
3886 | 41 | છે,જે |
5140 | 30 | 10,000 |
5355 | 29 | છે,અને |
5652 | 27 | 1,000 |
5654 | 27 | 20,000 |
7378 | 20 | 50,000 |
7647 | 20 | ૧૦,૦૦૦ |
8921 | 16 | 2,000 |
9423 | 15 | 3,000 |
11351 | 13 | ૨૦,૦૦૦ |
Rank in Wordlist | Frequency | Word |
---|---|---|
72592 | 1 | Kandukondain)''માં |
Rank in Wordlist | Frequency | Word |
---|---|---|
8015 | 19 | ૫% |
9961 | 14 | 80% |
10607 | 14 | ૫૦% |
10623 | 13 | 20% |
10630 | 13 | 50% |
12169 | 12 | ૧૦% |
12183 | 12 | ૪૦% |
12185 | 12 | ૬૦% |
12203 | 11 | 70% |
13202 | 10 | 30% |
Rank in Wordlist | Frequency | Word |
---|---|---|
44382 | 2 | AT&T |
67702 | 1 | 1&એનબીએસપી |
70968 | 1 | AT&Tએ |
Rank in Wordlist | Frequency | Word |
---|---|---|
69932 | 1 | 400–$500 |
74191 | 1 | US$1000 |
74192 | 1 | US$200 |
74193 | 1 | US$354 |
74194 | 1 | US$6 |
74195 | 1 | US$600 |
74196 | 1 | US$627.70 |
74197 | 1 | US$91.75 |
74198 | 1 | US$963 |
131753 | 1 | પેટા-$80,000 |
Rank in Wordlist | Frequency | Word |
---|---|---|
192 | 673 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
4465 | 35 | .' |
27082 | 4 | Hitler's |
33236 | 3 | D'elles |
33345 | 3 | Philosopher's |
38539 | 3 | પિપલ'સ |
40856 | 3 | રા'લાખા |
44504 | 2 | D'eux |
44545 | 2 | Earth's |
44660 | 2 | India's |
44907 | 2 | Shakespeare's |
Rank in Wordlist | Frequency | Word |
---|---|---|
44447 | 2 | C++/CLI |
52514 | 2 | ઝીપ+4 |
67608 | 1 | 0-4-0+0-4-0 |
67885 | 1 | 1001+મુ |
69066 | 1 | 2+2 |
71432 | 1 | CTRL+TAB |
73028 | 1 | N+1 |
73221 | 1 | OSPD+OSW |
73526 | 1 | R.+Rahman |
74147 | 1 | U+0370 |
Rank in Wordlist | Frequency | Word |
---|---|---|
3944 | 40 | અને/અથવા |
8927 | 16 | TCP/IP |
11659 | 12 | ટીસીપી/આઈપી |
19950 | 6 | અને/કે |
22898 | 5 | http://bharatdiscovery |
22899 | 5 | http://news |
23579 | 5 | કિમી/કલાક |
26847 | 5 | ૨/૩ |
26856 | 5 | ૩/૪ |
26872 | 4 | 1/2 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots